Memory Subsystem Design for Multithreaded Processors
نویسندگان
چکیده
Multithreading processors pose new challenges and new opportunities for cache/memory hierarchy design. Multithreading significantly alters the data reference stream seen by the memory subsystem. Multithreading also demands very different performance characteristics from the cache hierarchy than a typical (uniprocessor) CPU. This paper is specifically concerned with memory hierarchy design considerations for simultaneous multithreading (SMT) processors [7]. While multithreading processors may have created a new memory throughput crisis, their latency-tolerance also provides an opportunity. We can trade memory latency in various ways to buy increased throughput-tolerance. In this paper, we examine several ways to trade latency for throughput, as well as other mechanisms for increasing throughput-tolerance. We examine memory associativity, hierarchical cacheing, and cache block size in light of these new priorities. We also consider new cache replacement as a way to reduce the L1-L2 bus utilization and implicitly to increase the over all throughput of the system.
منابع مشابه
The Use of Multithreaded Processors in DASH
DASH is a scalable shared memory multiprocessor architecture which employs directory based cache coherence. Due to the physical distribution of the memory in DASH, there is potential for long memory latency. Although a number of latency reducing and hiding techniques such caching, a weaker memory consistency model, and pre-fetching are employed by DASH, the use of Multithreaded processors is no...
متن کاملOperating System Scheduling for Chip Multithreaded Processors
This dissertation addresses operating system thread scheduling for chip multithreaded processors. Chip multithreaded processors are becoming mainstream thanks to their superior performance and power characteristics. Threads running concurrently on a chip multithreaded processor share the processor’s resources. Resource contention, and accordingly performance, depends on characteristics of the c...
متن کاملCacheless Instruction Fetch Mechanism for Multithreaded Processors
The speed difference between processors and memories has become to one of the biggest problem in designing memory systems. While this primarily limits fast sequential access to data in memory it also sets constraints to efficient instruction fetch. In computers using single threaded processors this latter problem has traditionally been partially solved by using instruction caches, but in fast m...
متن کاملAnalysis of communications and overhead reduction in multithreaded execution
In a multithreaded execution, each thread can be thought of as running on its own virtual processor, with several virtual processors multiplexed onto a single physical processor. At any given time, some of these virtual processors are either sending or waiting for messages. When the degree of multithreading is high, there is a high potential load on the interconnection network. It is important ...
متن کاملRunning Parallel Applications on an Mp with Multithreaded Superscalar Processors Running Parallel Applications on a Mp with Multithreaded Superscalar Processors
With lesser returns on adding more complexity to conventional superscalar processors, simultaneous multithreaded (SMT) superscalar processors seem to be a promising alternative. Unfortunately, most previous work has focused on systems running multiprogrammed loads of sequential applications. It is not clear how well these processors work in a shared-memory multiprocessor environment running par...
متن کامل